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Abstract 

The Generalized Discretizable Molecular Distance Geometry Problem is a distance geometry prob- 
lems that can be solved by a combinatorial algorithm called "Branch-and-Prune" . It was observed 
empirically that the number of solutions of YES instances is always a power of two. We give a proof 
that this event happens with probability one. Keywords: distance geometry, symmetry, Branch-and- 
Prune, power of two. 



1 Introduction 

We consider the following problem arising in the analysis of Nuclear Magnetic Resonance (NMR) data 
for general molecules. 

Molecular Distance Geometry Problem (MDGP). 

Given an undirected graph G = (V, E) and a function d : E — > R, decide whether there is an 
embedding x : V — > IR 3 such that 

y{u,v} € E {\\x u - x v \\ = d uv ) (1) 



The MDGP is usually cast as a nonconvex unconstrained mathematical program min^r^ „i ES (||x« — 
1 1 2 — d^ v ) 2 , which can be solved using continuous Global Optimization techniques [1 . See [2l [3] for 
surveys. 

If the NMR data is obtained from a protein, it is usually possible to distinguish between backbone 
atoms and atoms of side chains. We can then attempt to place the backbone first [4] and the side chains 
later [5]. We therefore assume that G is the graph of the backbone. If a total order is given on V such 
that for each atom v there are at least three atoms that are adjacent to it in E and preceding it in the 
order (these are called the adjacent predecessors of v), if strict triangle inequality holds for d restricted to 
at least one triplet of adjacent predecessor for each vertex, and if spatial positions are given for the first 
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three atoms, there is only a finite number of possible embeddings compatible with the given distances, 
which can all be found using the so-called Branch-and-Prune (BP) algorithm [4 . The subset of MDGP 
instances for which the described atomic order exists is called Discretizable MDGP (DMDGP) 3 
and is known to be NP-hard [BJ. It was empirically observed that for any given practical instance, BP 
always finds a number of solution that is a power of two [1] . An isolated earlier counterexample, derived 
via a complexity reduction, given as Lemma 5.1 in [BJ, falsifies this conjecture. In this paper we give a 
formal description of the DMDGP in arbitrary dimensions (Sect. [2]), of the BP algorithm and some of its 
theoretical properties (Sect. |3]). We then study some geometrical aspects of the BP tree (Sect. [4]), and 
prove that the number of solutions of feasible DMDGP instances is a power of two with probability one 
(Sect. [5]). We also exhibit a family of counterexamples (Sect. [BJ that are much more intuitive than the 
one given in [BJ. 



2 The Discretizable Molecular Distance Geometry Problem 

The generalization of the MDGP to arbitrary dimensions asks for an embedding of G in ~R K satisfying 
(|T]) and is called the Distance Geometry Problem (DGP). The generalization of the DMDGP to 1*- 
replaces triplets with if-uples of adjacent predecessors and strict triangle with strict simplex inequalities 
[7]. For a set U = {xi G K | i < K + 1} of points in R^, let D be the symmetric matrix whose (i,j)-th 
component is \\xi ~ Xj\\ 2 for all i,j < K + 1 and let D' be D bordered by a left (0, 1, ... , 1) T column 
and a top (0, 1, ... , 1) row (both of size K + 2). Then the Cayley-Menger formula states that the volume 

Ak[U) of the X-simplex on U is given by Ak{U) — J ^(KVf ^'l- The strict simplex inequalities are 

given by Ak(U) > 0. For K — 3, these reduce to strict triangle inequalities. We remark that only the 
distances of the simplex edges are necessary to compute Ak(U), rather than the actual points in U ; the 
needed information can be encoded as a complete graph J^k+i on K + 1 vertices with edge weights as 
the distances. 

Let n = \V\ and m = \E\. For all v € V, let N(v) = {u G V \ {u,v} G E} be the star of vertices 
around v (also called the adjacencies of v); for a directed graphs (V, A), where A C V x V, we denote 
the outgoing star by N + (v) = {u G V | (v, u) G A}. For an order < on V, let j(v) = {u G V | u < v} be 
the set of predecessors of v, and let p(v) = \j(v)\ + 1 the rank of v in <. For V' C V, we denote by G[V] 
the subgraph of G induced by V . For a finite set M, let V(M) be its power set. We call an embedding 
x of G valid if |T]) holds for G. For a sequence x = (xi,..., x n ) and a subset U C {1, . . . , n} we let x[U] 
be the subsequence of x indexed by U. If x is an initial subsequence of y, then y is an extension of x. 

Generalized DMDGP (GDMDGP). Given an undirected graph G = (V, E), an edge weight 
function d : E — >• R+, an integer K > 0, a subset Vb C V with \Vq \ = K, a partial embedding 
x : Vo — >• M. K valid for G[Vb], and a total order < on V such that: 

{v G V | p(v) < K} = V ; 

VveV (p(v) > K -4 \N(v) n j(v)\ > K); 

to e V n Vb ^Uv C iV(u) n 7(u) (Gr[f7„] = K K A 

AK-x{U v )>QAVueU v {p{v)-K<p{u)<p{v) - 1)), 

decide whether there is a valid extension x : V — > of i. 

Conditions ([2EJ) allow the search for vertex v to only depend on the K vertices of rank preceding p(v), 
as x v is the intersection of at least K spheres centered at x u and with radius d uv for all u G U v . This, 
in particular, implies that the predecessors of v are placed before v, so that all of the distances between 
all predecessors are known when placing v. Thus, we can replace the condition G[t/„] = K-k in © by 
\U V \ = K. We remark that eliminating the condition to G U v (p(v) — K < p(u) < p(v) — 1) from ^ 
and fixing K gives rise to a problem called the Discretizable Distance Geometry Problem in K 



(2) 
(3) 

(4) 
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dimensions (DDGPa'), discussed in [8,, which is itself a subset of instances of the DDGP, where the 
number of dimensions is not fixed. The DMDGP is therefore the subset of the GDMDGP for which 
K = 3 [BJ. In summary, we have the following diagram (where arrows represent the C relation). 



DDGP 3 



- MDGP 



DMDGP 



DGP 



GDMDGP 



DDGP 



A polynomial reduction from Subset-Sum to the DMDGP shows that the GDMDGP is NP-hard. Even 
restricting the range of d to the set of rational numbers, it is not clear whether there are YES certificates 
for the DDGP that have polynomially bounded size. 

In the following, we assume that the probability of any point of 1^ belonging to any given subset of 
M. K having Lebesgue measure zero is equal to zero. 



3 Branch-and-Prune 

The BP algorithm for the DMDGP, presented in [3], can easily be generalized to the GDMDGP. As 
mentioned above, once the vertices of U v have been embedded in M. K , the known distances from vertices 
in U v to a given v will enforce the position of v as the intersection of K spheres. Under strict simplex 
inequalities, this intersection consists of at most two distinct points. The BP exploits this fact to re- 
cursively generate a binary search tree of height at most n where a node at level i represents a possible 
placement in M. K of the vertex of G with rank i in <. Paths of length n correspond to valid embeddings. 

Let G be a GDMDGP instance. Consider v eV with rank p(v) = i> K, let G v = G[y(v) U {v}] and 
i be a valid embedding of G[7(u)]. We characterize the number of extensions of x valid for G v in the 
following lemmata (which also hold for the DDGP); the proof technique for Lemma 13.11 is well known 
[9J [TU] ; we repeat the proof here because we believe the explicit form ([7]) is unpublished. 

3.1 Lemma 

If \N(v) n j(v)\ = K then there are at most two distinct extensions of x that are valid for G" . If one 
valid extension exists, then with probability 1 there are exactly two distinct valid extensions. 

Proof. Since \N(v) fl 7(v)| = K , U v — N(v) D j(v) and v is at the intersection of exactly K spheres in 
M. K (each centered at x u with radius d uv , where u € U v ). The position z £ M. K of v must then satisfy: 

\fueU v \\z -x u \\ = d uv => ||z|| 2 - 2x u ■ z+ \\x u \\ 2 = d^ v . (5) 

As in |llj . we choose an arbitrary w € U v , say w — max< U v , and subtract from the Eq. ([S]) indexed by 
w the other equations of (O, obtaining the system: 

V?i £ U v \ {w} 2{x u -x w )-z = (\\x u \\ 2 - d 2 uv ) - {\\x w \\ 2 - d 2 wv ) 
\\z\\ 2 - 2x w ■ z+ \\x w \\ 2 = d 2 wv . 

The system ([6j consists of a set of K — 1 linear equations and a single quadratic equation in the K- 
vector z. We write the linear equations as the system Az = b, where the (w,j)-th component of A is 
2(x U j — x W j) 7 the u-th component of b is ||ir M || 2 — ll^tull 2 — d 2 v + d 2 0V , A is (K — 1) x K and b G R K_1 . 
By strict simplex inequality, A has full rank (for otherwise ^2 U ^ W A tl (x tl — x w ) — implies that x w is 
in the span of {x u \ u € U v }, and hence that Ak-i(U v ) = 0); so without loss of generality assume 
that the square matrix B formed by the first K — 1 columns of A is invertible. Let zb be the vector 
consisting of the first K — 1 components of z; then the linear part (first K — 1 equations) of ([BJ yields 
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zb = B l {b — Nzk), where N = 2(x u k — x w k \ u € U v \ {u>}) G K 1 . After replacement of in (JSJ) 
with zb(zr-), we obtain the following quadratic equation in zk ■ 

(\\N\\ 2 + 1)4 - 2({b + x wB )N + x wK )z k + {\\x wB -If + x 2 wK - d 2 wv ) = 0, (7) 

where b = B~ 1 b and N = B~ X N . If the discriminant of © is negative then no extension of x to v 
is possible and the result follows. If the discriminant is nonnegative, (Q has solutions z' K ,z'^ yielding- 
points z' — (zb(z' k ), z' k ) and z" = (zb{z'^), z'^) G M. K , which are distinct with probability 1 because the 
discriminant is zero with probability 0. The extended embeddings, distinct with probability 1, are given 
by (x.z') and {x,z"). □ 

3.2 Lemma 

If \N(v) n 7(v)| > K then, with probability 1, there is at most one extension of x. 

Proof. Consider a subset S C N(v) D 7(u) such that \S\ = K + 1 and 5 2 U v . Either there is at least 
one point x v such that (x, x v ) is an embedding of G[S U {v}] that is valid w.r.t. the system: 

Vue S ( x lk ~ ZxukXvk + xl k ) = d 2 uv , (8) 

or the system has no solution. In the latter case, the result follows, so we assume now that there is a 
point x v satisfying (|5]). Since the points x u are known for all u £ S, © is a quadratic system with K 
variables and K + 1 equations. As in the proof of Lemma 13 . 1 1 we derive an equivalent linear system 
from ([5]). Since d satisfies the strict simplex inequalities on U v with probability 1 and S D U v , by [T2] 
{x u u G S} are not co-planar and the system has exactly one solution. □ 

3.3 Lemma 

With the notation of Lemma \3.1[ if x is a valid embedding for G[U V ], then z" is a reflection of z' with 
respect to the hyperplane through the K points of x. 

Proof. Any sphere in WL K is symmetric with respect to any hyperplane through its center; so the inter- 
section of up to K spheres in R K is symmetric with respect to the hyperplane containing all the centers. 

□ 

Reflections with respect to hyperplanes are isometries, and can therefore be represented by linear 
operators. If a G M. K is the unit normal vector to a hyperplane H containing the origin, then the reflection 
operator Rq w.r.t. H can be expressed in function of the standard basis by the matrix / — 2aa T , where 
/ is the K x K identity matrix [13) . If H is a hyperplane with equation a T x = ao, with ao ^ 0, s.t. x is 
ordered and a^, for some 1 < i < K, is the nonzero coefficient of smallest index in a. Then, the reflection 
operator R acting on a point p G M. K w.r.t. H is given by R(p) — Ro(p — ^r^i) + ^ e i, where G M. K is 
the unit vector with 1 at index i and elsewhere: we first we translate p so that we can reflect it using Rq 
w.r.t. the translation of H containing the origin, then we perform the inverse translation of the reflection. 

A formal description of the BP algorithm for the GDMDGP (which also holds for the DDGP) is given 
in Alg. [1] It builds a binary search tree T = (V, ^4), directed from the root to the leaves, whose nodes are 
triplets a — (x(a), A(a), p{a)). For a G T we denote by p(a) the unique path from the root node r of T 
to a; x{a) is an extension of the embedding x~ found on p(a~), where a~ is the unique parent node of a. 
Next, A(a) G {0, 1} distinguishes whether a is a "left" or a "right" subnode of a~ . More precisely, let a 
be a node at level % in T, v = x be a partial embedding of G[U V ], and x = a v o be the equation of 

the {(K — l)-dimensional by ©) hyperplane through the points of x. Assuming u = p~ x {i — 1), a v G M. K 
is oriented so that a v ■ a u > 0; then: 

JO if ajx(a) < a v0 
I 1 if a„ x(a)i > a v0 . 
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Lastly, /Lt(a) = B if x is a valid extension of x _ , in which case the node is said to be feasible, and /i = B 
otherwise. This allows us to retrieve the set X of all valid embeddings of G by simply traversing T 
backwards from the leaf nodes marked EH up to r. 



Algorithm 1 The Branch and Prune algorithm. 



Require: Partial embedding x of first K vertices of G 
Ensure: Set X of valid embeddings of G 
Let a = (x\, 0, EH) and a' = (x\, 1, EH) 
Initialize V = {a, a'} and .4 = {(r, a), (r, a')} 
for 1 < i < K do 

Let a = (Si,0,ffl), a' = (s is l,B), /3 = (x 4 _i,0,ffl) 
Let V <- V U {a, a'} and A <- A U a), (ft, a')} 
end for 

Branch AndPrune (K + 1, (x K ,Q,3i)) 
Let X = {x(6») | 9 G V A |iV+(0)| = A /i(6>) = ffl} 
stop 

function BranchAndPrune(i, /3): 
if i>nV|U = B then 

return 
end if 

Let v = /5 _1 («) 

Choose U v such that Eq. dU holds 

Compute the equation ajx = a v o of the hyperplane through x[U v 
Let Z = {z 1 , z"} be extensions of x{0) to v, and Z' = Z 
for zeZdo 

if H{u,w} G £ \\x(P) u - z\\ ^ d uv then 

Let Z = Z \ {z} 
end if 
end for 

if Z= {z',z"} then 
if ajz' < a v o then 

Let a = (z',0,m), a' = (z",l,ffl) 
else 

Let a = (z", 0, EH), a' = (z', 1, EH) 
end if 
else if Z = {z} then 
if al z < a v o then 

Let a = (z, 0, EH), a' = (Z' \ {z}, 1, B) 
else 

Let a = (z, 1, B), a' = (Z' \ {z}, 0, B) 
end if 
else 

return 
end if 

Let V <- V U {a, a'} and ^ <- .4 U {(/3, a), (^, a')} 
for 6» G A r+ (/3) such that fj,(6) = B do 

BranchAndPrune(i + 1, 0) 
end for 
return 



We remark that Alg. [T] differs from the original BP formulation [4] because it applies to K dimensions 
and stores the binary search tree. In the rest of this section we list some of the elementary properties of 
the BP algorithm. 
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3.4 Lemma 

At termination of Alg. [TJ X contains all valid embeddings of G extending x. 

Proof. Z exists with probability 1 by Lemma 13.11 Every embedding in X is valid because of Steps [TH] 
and l2"0ir2"Tl No other valid extension of x exists because of Lemmata I3.HI3.2I □ 

We now partition V in pairwise disjoint subsets Vi, . . . , V n where for all i < n the set Vi contains all 
the nodes of V at level i of the tree T. 

3.5 Proposition 

With probability 1, there is no level i < n having two distinct feasible nodes (3,9 S V* such that 
\{a e N+(0) | /i(a) = EB}| = 1 and \{a € N+{6) | /j,(a) = EB}| = 2. 

Proof. We show that for all i < n the event of having two distinct nodes 0, 9 S Vi, with = v, such 

that /? has one feasible subnode and 9 has two has probability 0. Consider T v = N(v) Cij(v): if \T V \ = K 
then by Lemma I3TT1 8 should have exactly two feasible subnodes with probability 1; since it only has one, 
the event \T V \ = K occurs with probability 0. Since \T V \ > K by the event \T V \ > K occurs with 
probability 1. Thus by Lemma 13 . 2 1 1 here is at most one valid embedding extending the partial embedding 
at v, which means that the two feasible subnodes of 9 represent the same embedding, an event that occurs 
with probability 0. □ 



4 Geometry in the BP Trees 

Here we exhibit some geometrical symmetry properties between the valid embeddings of a GDMDGP 
instance. We prove in Thm 14.81 that for any valid embedding y 6 X, if the BP tree branches at level i on 
the path to y, then the embedding obtained by reflecting all the nodes after level i is also valid. 

To this aim, we need to emphasize those BP branchings which carry on to feasible leaf nodes along 
both branches. For y G X and a vertex v € V \ Vq we denote T(y,v) the following property: 

T(y,v): there are feasible leaf nodes 8,f3' G V n such that x{8) = y, p(/3) fl V p ( l ,)_i = p(/3') n 
V /5( „ ) _ 1 and p(0) n V p(v) ^ pQ3') n V p{v) . 

If T(y,v) holds, it is easy to show that p(/3) n V p ( t ,)_i contains a single feasible node with two fea- 
sible subnodes. With T(y,v) true, we let R v be the Euclidean reflection operator with respect to 
the hyperplane through y[U v ] (as defined on p. 0}. Define R v = Z^W- 1 x (R v ) n -p( v ) , i.e. R v y = 
(yi, . . . , yi-i, R v yi, ■ ■ ■ , R v y n ). We remark that for all « < £ < n and for all a£V< the set p(a) fl Vi has 
a unique element. 

4.1 Corollary 

Let a e Vj-i for some i > 1, v = and N + (a) = {n,(3} with ^(77) = fi(8) = EH. Then x(n) v — 

R v x(8) v . 

Proof. This is a corollary to Lemma 13.31 □ 

4.2 Lemma 

Let a e V,_i for some i > 1 such that N + (a) — {n',8'}, u = v > u with p(v) = I, and consider 

two feasible nodes 77, e Vt such that v( = p(rj) n Vi and 8' — p(8) n Vi. Then, with probability 1, the 
following statements are equivalent: 
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(») V % < j < £, x(/3") w = R u x(n") w , where rf = p(n) n Vj, f3" = p(/3) n V jt and w = p^d); 
(n) Vi<j<£, \(r)") = 1 - X(/3"), with rj" = p(rf) n V 3 and /3" = p(/3) n Vj. 



Proof. Let a" x = a° , aj x = a* be the equations of the hyperplanes H v , Hp defined respectively 
by x(t])[U v ] and x(/3)[U v ], with the normals oriented as explained on pageSJ We prove by induction on 
£ — i that the following assumption is equivalent to (i) and (ii) 



(Hi) for all i < j < £, x(fi") w = R u x(n") w and a u ■ a° w = a u ■ a} w , where rj" = p(rj) n Vj, (3" = p(/3) n Vj, 
w — p~ 1 (j), and a^, and a4 are the normal vectors of the hyperplanes H v » and Hpn oriented as 
usual. 



If I = i, then (i)\ and (Hi) hold simultaneously. Indeed, r\ = rj' and f3 = (3', hence x(fi) v = R u x(rj) v 



(Lemma 

„0 



and X(n) = 1 - X(f3) (Alg-ED Steps and |28) . In addition, we have H n = R u Hp, therefore 



I . Because the orientation of a° , a}, is such that a n 



i}. > 0, the result holds. 



Assume that the equivalence stated above holds for level £ - 
I. In the sequel, denote t = p~ x (l — 1). 



1 , we show that it is still the case at level 



(I)|<S>p)| Suppose for all i < j < £, x(/3") w = R u x(n"), 
hypothesis, both statements are equivalent). Hence, H^n 
points generating the hyperplanes either belong to H a 
in particular if we choose rj" , j3" G Vi-\. In addition, if we use the induction hypothesis (i) 

and A(n) 



and A (77") = 1 - \{fi") (by the induction 
= R u Hpn holds for all j, because the K 
or are reflections of each other. This is true 



we have a, 



SO GSj. 



(in) ), 



are directed similarly w.r.t a v 



1 — A(/3) if and only if 



x((3) v = R u x( V ) v (see Fig.QJ. 




(a) a° x(n) v > a° Q and oj x(fi) v > oj (b) a° x(-q) v > a° and a* x(fi) v < oj 

Figure 1: Proof of Lemma [4.21 Case (fTa|) shows the contradiction deriving from X(n) = X((3) = (or 
x (P)v 7^ R u x(r]) v ), and case (jlb|) the situation that actually occurs. 



(ii) =>■ (Hi) Suppose for all i < j < £, X(rj") = 1 — A(/3"). By the previous result, we also know 
that i < j < £, x((3") w = R u x(n") w . It remains to prove that a u ■ a° = a u ■ a\, i.e. that the angles 
9® and 9\ formed by these vectors have the same cosine. Notice once again that H n — R u Hp. By 
induction, we know that the angles 6®, 8\ formed by a u and respectively a®, a}, have same cosine. With 
probability 1, the hyperplanes H^, Hp are not parallel, hence their normal vectors cannot be identical, 
therefore, 6® = —8\ (see the illustration on Fig. [2]). Denote 9°, 9 1 the angles formed respectively by a® 
and a°, and by a\ and a\. We also have, H v „ = R u Hp„, where n'\f3" £ Ve-i, hence the normal vectors 
of these 4 hyperplanes are also symmetric, which implies 8° = —9 1 or 9° = ir — 1 . By the definition of a° 
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Figure 2: Proof of Lemma |4~2"1 illustration of the fact that a u ■ a® = a u ■ a\. 



and a\ (page IU, since the scalar products are positive, — ir/2 < 9°,9 1 < tt/2, thus 9° — —6 1 . Therefore, 
6° — 9® + 9° — —9} — 9 1 = — 0j, which concludes this part of the proof, (m) =>■ (i) Obvious. □ 



4.3 Proposition 

Consider a subtree T' of T consisting of K + 2 consecutive levels i — K — 1, . . . , i (where i > 2K + I), 
rooted at a single node r\ and such that all nodes at all levels are marked EEL Let p — 2 K+1 and consider 
the set Y' = {yj \ j < p) of partial embcddings of G at the leaf nodes {otj \ j < p} of T' . Let 
u = p _1 (i — K — 1) and v = /3 _1 (i). Then with probability 1 there are two distinct positive reals r,r' 
such that \\yj((Xj) u - yj(otj) v \\ G {r, r'} for all j < p. 



Proof. Fig. [3] shows a graphical proof sketch for K = 2. With a slight abuse of notation, for a vertex 




Figure 3: Proof of Prop. 14.31 in R 2 . The arrangement of three segments gives rise, in general, to two 
distances r, r' between root and leaves. 

W 6 V in this proof we denote by R w the set of all reflections at level w. We order the otj nodes so that the 
action of R v on (ai, . . . ,a p ) is the permutation FJ . mod 2= iOjJ + !)• Let t = p~ l (i — 1). Since all nodes 
are feasible, \\yj(oij) v — yj(otj)t\\ — d tv and \\yj{ctj) u — Uj(oij)t\\ — d ut for all j < p (we remark that {t,v} 
and {u,t} must be in E by the definition of the GDMDGP). With probability 1, the segments through 
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Uj( a j)u and yj(oij)i (where j < p) do not respectively lie within the hyperplanes defining the reflections 
R v ; and the same holds for the segments through yj{(Xj)t and yj(otj) v . Thus, there is a set Q of positive 
reals n, . . . , r p s.t. for all j < p with j mod 2 = 1 we have \\yj(ctj)u — Ui(oij) v || = r j and ||j/j+i(aj+i) u — 
yj+i(aj + i) v \\ = fj+i, which shows \Q\ < p — 2 K+1 . By Lemma T4.2I the action of R* on (a%, . . . ,a p ) is 
the permutation rj . mod 4=1 {j,j + 3)(j + 1, j + 2): this implies that fj = and ry+i = for all j 
mod 4=1, which shows |Q| < p/2 = 2 K . Inductively, for a vertex w s.t. i — K < p(w) < i — 1 the action 
of ^ ^ EL mod 2 ,- P w +1 (i,i + 2-^)+ 1 - l)(j + 1, j + 2 l -"(-)+ 1 - 2) • • • (j + T-P^ - 1, j + 2*-"(™)), 
which implies that |Q| < 2 K+1 Therefore p(w) = i- K proves that |Q| < 2. The case \Q\ = 1 can 

only occur if yj(atj) u , yj(oij)t and yj{oij) v are collinear for all j < p, an event that occurs with probability 
0. □ 



Prop. I4~51 is useful in order to show that certain configurations of nodes within T can only occur with 
probability 0. 

4.4 Example 

Consider a subtree T' of T like the one in Fig. O embedded in M. 2 , and suppose that all nodes at level 
u, w, t are marked EE, and further that only one node within cti, «2 is feasible, only one node within 0:3, 0:4 
is feasible, only one node within aj,as is feasible, and a^,a% are both infeasible. This must be due to a 
distance d u > v with u' < u. Consider now a circle C completely determined by its center at yi(ai) u > and 
its radius d u > v ; if C also contains the points at the nodes a\,a 4 , as or the points at the nodes 0*2, 03, ct-j 
then we must have u' — u, in which case also one of as, ciq will be feasible (against the hypothesis). And 
the probability that C should contain the points at the nodes ct\, 013, as or 0:2,0:4, ot-j is zero. Hence T' 
can only occur with probability 0. 

4.5 Corollary 

Consider a subtree T' of T consisting of K + q + 1 consecutive levels i — K — q, . . . ,i (where i > 2K + q 
and q > I), rooted at a single node ij and such that all nodes at all levels are marked EEL Let p = 2 K+q 
and consider the set Y' — {yj \ j < p} of partial embeddings of G at the leaf nodes {ctj j < p} of T' ■ 
Let u = p~ l (i - K — q) and v = p~ l (i). Then with probability 1 there is a set H uv = {r, | j < 2«} of 2 q 
distinct positive reals such that \\yi(ai) u — yi(cti) v \\ & H uv for all i < p. 



Proof. The proof of Prop. 14.31 can be generalized to span an arbitrary number of levels by induction on 
q. Two distances r 3l , rj 2 S H uv can only be equal by collinearity of some subsets of points, an event 
occurring with probability 0. □ 



4.6 Corollary 

Let y € X and v € V \ Vo such that T(y, v) holds. If {u, w} G E with u < v < w and p(w) — p(u) > K 
then d uw € H uw with probability 1. 

Proof. Since T(y, v) holds, then the GDMDGP instance is YES and there must exist at least two feasible 
nodes at level p(w) in T. If d uw ^ H uw the probability that a completely determined sphere contains 
two arbitrary points in M. K is zero. Since the instance is a YES one, however, the BP algorithm does not 
prune all feasible nodes due to d uw . By Cor. 14.51 the only remaining possibility (which therefore occurs 
with probability 1) is that d uw g H uw . □ 



4.7 Corollary 

Let y £ X and v G V \ Vo such that T(y,v) holds. If u G V with u > v then R v y u belongs to a valid 
extension of y[U v ]. 

Proof. If there is no edge {w, u} G E with p(u) — p(w) > K the result follows by Cor. 14.11 Otherwise, 
by Cor. 14.61 d wu G H wu . As in the proof of Prop. POl all pairs of points that are feasible w.r.t. d wu are 
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reflections of each other w.r.t. R v . □ 
4.8 Theorem 

Let y £ X and v G V \ Vq such that T(y, v) holds. Then R v y G X with probability 1. 

Proof. We have to show that R v y is a valid embedding for G = (V,E). Partition E into three subsets 
Ei,E 2 ,E 3 , where Ex = {{t,u} £ E | t,u < v}, E 2 = {{t,u} £ E \ t, u > v} and E 3 = {{t,u} £ E \ t < 
v A u > v}. For Ei, by definition \\(R v y) t — (R v y) u )\\ — \\Iy t — Iy u \\ = \\yt — Vu\\ — d tu as claimed. 
For E 2 , \\(R v y) t - (R v y) u )\\ = \\R v yt - R v Vu\\ = \\ut- Vu\\ = d tu because R v is an isometry. For E 3 , 
we aim to show that \\Iyt — R v y u \\ — d tu . Since y £ X, by Lemma [3.41 there is a feasible leaf node 
a with x(a) — y. Because T(y,v), 3?/ G V p ( 1 ,)_ 1 such that x{rj) — y[^{v)} and N + (r]) = {/?,/?'} with 
/i(/3) = = ffl; we can assume without loss of generality that p(a) D V p ( 1J ) = {/?}; furthermore, again 

by T(y, u), there is at least one feasible leaf node a' such that p(a')nV p ( v ) = {/?'}■ Let {w} = p(a) n V p ( u ) 
and {w'} = p(a') (~1 V p („). Because u/ is feasible, ||x(w') t — x(uj') u \\ = d tu ; because rj is an ancestor of both 
a and a' at level p(v) — 1 and t < v, p(a') n V p (t) = p(ct) H V p (t) , which implies that x(u/)t = x(w)t = j/j. 
Thus, ||y t — y u \\ — d tu = \\yt — x(ui') u \\. Furthermore, because /?' G p(w') fl V p (^), x(w') extends x(/3'). 
By Alg. [U Steps and HH1 A(^) = 1 - A(/3'). Because a is feasible, at every level p(v!) £ V such 
that v < v! < u the node 9 £ p(a) fl V p („') has / G {1,2} feasible subnodes; by Prop. 13.51 the node 
& G p(a') fl V P ( U ') also has / feasible subnodes. If / = 2, by Cor. 14.71 it is possible to choose a' so that 
\{0') = 1 — \{9) with probability 1; if / = 1 then by Alg. [TJ Steps l32l and l34l all feasible nodes inherit 
the same A value as their parents, so X(6') = 1 — X(9). By Lemma T4.21 x(uj') u — R v y u with probability 
1. Hence \\yt — R v y u \\ = dt u as claimed. □ 



5 Symmetry and Number of Solutions 

Our strategy for proving that feasible GDMDGP instances have power of two solutions with probability 
1 is as follows. We map embeddings y G X to binary sequences x € {0> l} n describing the "branching 
path" in the tree T. We define a symmetry operation on \ by flipping its tail from a given component i 
to its end (this operation is akin to branching at level i). We show that the cardinality of the group of 
all such symmetries is a power of two by bijection with a set of binary sequences. Finally we prove that 
the cardinality of the symmetry group is the same as \X\. 

For all leaf nodes a G V with p(a) = EH let x( a ) — (MP) I ft 6 P( a ))j since embeddings in X are also 
in correspondence with leaf EH-nodes of T by Alg. [TJ Step [51 x defines a relation on X x {0, l} n . 

5.1 Lemma 

With probability 1, the relation x is a function. 

Proof. For x to fail to be well-defined, there must exist an embedding x which is in relation with two 
distinct binary sequences x'iX") which corresponds to the discriminant of the quadratic equation in the 
proof of Lemma 13. II taking value zero at some rank > K, which happens with probability 0. □ 

Let 5 = {x(y) | y G X}. For y G X let y l be its subsequence (xi, . . . , Xi). We extend x to be defined on 
all such subsequences by simply setting x l = (x(y)ii ■ ■ ■ > x(j/)i); x(v) is valid if y is a valid embedding. 

Let N = {1, . . . ,n} and g be the n x n binary matrix such that gij = 1 if i < j and otherwise 
(the upper triangular n x n all-1 matrix); let be its i-th. row vector and T = {gi \ i G N}. Consider 
the elementwise modulo-2 addition in the set F£ (denoted ®): this endows F£ with an additive group 
structure with identity e = (0, . . . , 0) where each element is idempotent. Thus, Q = (FJ, ©) — C 2 ■ This 
group naturally acts on itself (and subsets thereof) using the same © operation. It is not difficult to 
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prove that T is a set of group generators for Q and a linearly independent set of the vector space V given 
by Q with scalar multiplication over Fa. For all S C N, let 

9S = ®ft= 

and define a mapping : V{N) — > Q given by <ft(S) = gs- 

5.2 Lemma 

</) is injective. 

Proof. We show that for all S,T CN, if g s = g T then S = T. 





9s 


9t 




■ ©ft = 


©ft 






»eT 






e 




ieS ieT 




idempotency 


* © Si © ft = 

ies ieT 


e 


ft © ft = gf 


> © ft® © gf = 

ieSAT iesnT 


e 


idempotency 


> © ft = 

ieSAT 


e 


linear independence = 


► SAT = 







> S = 


T. 



This concludes the proof. □ 

5.3 Lemma 

For all H CT,\(H}\ = 2l H L 

Proof. The restriction of function <^> to V{H) is injective by Lemma l5.2l Furthermore, each element g of 

(if) can be written as © gi for some S 1 £ if because if is a spanning set for the vector space if over F£ , 

ies 

which is setwise equal to the group (if). Thus </) is surjective too. Hence 4> is a bijection between V(H) 
and (if), which yields the result. □ 

Let I be the set of levels of T for which from all nodes with two valid children there is a path going 
to a feasible leaf through both children. Let L = {ft G T | i 6 f } and A = (L) be the subgroup of Q 
generated by L. 

5.4 Theorem 

If S ^ 0, for all £ £ 2 we have £ © A = S with probability 1. 

Proof. (=>) We show that £ffi A C 3 with probability 1; because (L) = A it suffices to show that £©(?i € 5 
for an arbitrary <7, £ L, i.e. that there exists a valid embedding w £ AT such that xi w ) = £ © ft- Let 
j/ £ X _1 (C) an d u = P 1 ^) such that T(y,v), and define to = ff^y (where i? 11 is defined in Thm. 14.81 
above); by Thm. 14.81 w £ X. Let a' be the leaf node of T such that x(a') = y; by Lemma 13^41 there is a 
leaf node (3' such that x(/3') = w. We have to show that for all I > i the node j3 £ p(/3') n Ve is such that 
A(,5) = 1 — A(a), where a is the node in p(a') H Vg. We proceed by induction on £ For I = % this holds 
by Lemma 1331 For £> i, the induction hypothesis allows us to apply Lemma I4T21 and conclude that the 
event A(a) = 1 — A(/3) occurs with probability 1. 

(<=) Now we show that S C £ © A with probability 1, i.e. for any ij £ S there is g £ A with S,® g = i]. 
We proceed by induction on n, which starts when n = AT + 1: if K + 1 g" i then |H| = 1, £ = and the 
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theorem holds; if K + 1 € / then |3| — 2, L = {gn+i} and the theorem holds. Now let n > K + 1; for 
all j G + 1, . . . , n - 1} define ~? = {%> \ £ G 3} and I? = {g e G T \ t G I A I < j}. By the induction 
hypothesis, for all f G S- 7 ' (£' © (L J ) = S 3 ). Now, either n £" I or n £ /; by Prop. 1X51 with probability 
1 if n $ I then nodes in V n _i can only have zero or one feasible subnode (let B™ be the set of all such 
feasible subnodes), and if n 6 I then nodes in V n -i can only have zero or two feasible subnodes /3 (let 
-BJ be the set of all such feasible subnodes). In the former case we let 5™ = {£,(x(f3)) | f3 G and 
L n = L n-i. in the latter we let E n = {^(x(fi)) | j3 G BJ} and i" = i™- 1 U {.g„}. In both cases it is easy 
to verify that the theorem holds for 5", L n : in the former case it follows by the induction hypothesis, and 
in the latter case it follows because g n — (0, . . . , 0, 1), namely, if r\ G 3 and n G I then take £ — rj® g n 
(the result follows by idempotency of g n ). □ 



5.5 Corollary 

Jf a GDMDGP instance is feasible, \X\ is a power of two with probability 1. 



Proof. By Lemma [5.11 y is a function with probability 1. Let x, x' £ X be distinct; then by Alg. [TJ 
Steps [551 HH1 EH and |3H the map x : X — > 3 is injective. By definition of 3 it is also surjective, hence 
\X\ = |5|. By Thin. 1511151 = |x©A| for all x G 5 with probability 1. It is easy to show that |x©A| = |A|, 
so by Lemma |5.3I \ X\ is a power of two with probability 1. □ 



6 Counterexamples 

We discuss a class of counterexamples to the conjecture that all GDMDGP instances have a number 
of solutions which is a power of two (also see Lemma 5.1 in [B]). All these counterexamples are hand- 
crafted and have the property that two distinct embeddings x,x' have at least a level i where Xi = x' it 
which is an event which happens with probability 0. For any K > 1, let n = K + 3, V = {1, . . . , n}, 
E = iih j} | < i - j < K} U {{1, n}} and <kj = 1 for all {i, j} G E. The first n - 2 = K + 1 points can 
be embedded in the vertices of a regular simplex in dimension K; then either x„_i = x\ or x n -\ is the 
symmetric position from x\ with respect to the hyperplane through {x2, ■ ■ ■ , x n -2}- In the first case, the 
two positions for x n are valid, in the second only x n = xi is possible (see Fig. [4] for the 2-dimensional 
case), yielding a YES instance where |A| = 6. 



„(00) 




„(°) 



X3 
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-(1) 



_ (0) 

X\ — x^ 



(a) Positions of the points on the plane. 
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(b) BP tree. 



Symmetric 



Figure 4: The counterexample in the case K — 2. Embeddings a; 
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x 5 is not. 
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,( n ) 



are valid, while 



Lastly, Fig. [S] shows an example where the (ii) => (i) implication of Lemma T4.2I fails for instances in 
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DDGP \ GDMDGP. This shows that any generalization of our result to the DDGP is not trivial. Let 
V = {1, ... ,6} (the graph drawing is the same as the embedding in R 2 ). The nodes 5', 6' linked with 
dashed lines show alternative node placements. Let Us = {3,4} and U§ = {1,2}. The line through the 
points 3, 4 does not provide a valid reflection mapping 6 to 6'. This happens because U§ does not consist 
of the two immediate predecessors of 6. 




Figure 5: A counterexample to Lemma H?2] applied to DDGP \ GDMDGP. 
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